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Lecture notes of an intensive course at Universidad de Sevilla, Summer 2011. 

Abstract. These expository lectures present a self-contained proof of the A2 
theorem — the sharp weighted norm inequality for Calderon-Zygmund opera- 
tors in L-^{w) — , which is here formulated in such a way as to reveal some 
additional information implicit in the earlier papers. This added data gives 
at once a new weighted bound for powers of the Ahlfors— Beurling operator, 
discussed in the end. A key ingredient of the A2 theorem is the probabilistic 
Dyadic Representation Theorem, for which a slightly simplified proof is given, 
avoiding conditional probabilities which were needed in the earlier arguments. 

1. Introduction 
The goal of the lectures is to prove the following A2 theorem: 

1.1. Theorem. For any Calderon-Zygmund operator T on , any w A2, and 

f G L'^(w), we have 

WTfh^w) < CtNa2||/IIl2(«,)- 

The proof will proceed via the following steps, in the same order: 

• Reduction to dyadic shift operators: every Calderon-Zygmund operator 
T has a representation in terms of these simpler operators, and hence it 
suffices to prove a similar claim for every dyadic shift S in place of T. 

• Reduction to testing conditions: in order to have the full norm inequality 

||S'/||l2(„) < CsiwjA^Wfh^w), 
it suffices to have such an inequality for special test functions only: 

||S'(1qu'~^)||l2(„) < Cs[w]a2Uqw^^\\l^w), 

\\S*{^Qw)\\l2(w-^) < Cs[w]a2Uqw\\l2(w~^)- 

• Verification of the testing conditions for S. 

In the original proof of this theorem, in Summer 2010, the two reductions were 
done in a different order: the (quite complicated) reduction to testing condition was 
obtained for general Calderon-Zygmund operators by Perez-Treil-Volberg [18]; my 
completion of the proof |B] then consisted of reducing these testing conditions for 
T to the testing conditions for S, and verifying the testing conditions for S as 
indicated in the last step above. The first two steps were assembled in the present 
order in our joint work [lOj , simplifying the overall argument: the reduction to 
dyadic operators and the verification of the testing conditions are essentially the 
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same, but the reduction to testing conditions is considerably simpler for the dyadic 
operators. The actual verification of the testing conditions, both in [6l[in| and in the 
present lectures, derives its main inspiration from the work of Lacey-Petermichl- 
Reguera [llj. 

Over the past year, different proofs and extensions of the A2 theorem have 
appeared; see the final section for a discussion and selected references. Some proofs 
do not proceed via the testing conditions, but all known proofs so far do need 
the reduction to dyadic operators. Some ingredients from the newer proofs will be 
exploited to round up corners of the presentation here and there in these lectures. 

2. Preliminaries 

The standard (or reference) system of dyadic cubes is 

2)'^ := {2-^([0, 1)'^ + to) : fc e Z, m e Z'^}. 

We will need several dyadic systems, obtained by translating the reference system 
as follows. Let u — {ujj)j^z £ ({0, 1}'')^ and 

I+ui := I + ^ 2"^Wj. 

j:2-J<f(/) 

Then 

2'^ := {I+u; : I G ^°}, 

and it is straightforward to check that inherits the important nestedness prop- 
erty of if /, J G then / n J G {/, J, 0}. When the particular w is unim- 
portant, the notation ^ is sometimes used for a generic dyadic system. 

2. A. Haar functions. Any given dyadic system 2 has a natural function system 
associated to it: the Haar functions. In one dimension, there are two Haar func- 
tions associated with an interval /: the non-cancellative := and the 
cancellative hj :— — 1/^), where Ig and Ir are the left and right halves 
of /. In d dimensions, the Haar functions on a cube I ^ Ii x ■ ■ ■ x are formed of 
all the products of the one-dimensional Haar functions; 

i=l 

The non-cancellative h*j = has the same formula as in d — 1. All other 

2'*-! Haar functions h] with rj G {0, 1}'^\{0} are cancellative, i.e., satisfy J — 0, 
since they are cancellative in at least one coordinate direction. 

For a fixed ^, all the cancellative Haar functions h^, I G 2 and 77 G {0, 1}'^\{0}, 
form an orthonormal basis of L^(K''). Hence any function / G -L^(R'*) has the 
orthogonal expansion 

/e® r;G{0,l}''\{0} 

Since the different 77's seldom play any major role, this will be often abbreviated 
(with slight abuse of language) simply as 

and the summation over 77 is understood implicitly. 
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2.B. Dyadic shifts. A dyadic shift with parameters i,j £ N := {0, 1,2,.. .} is an 
operator of the form 

Sf=J2^Kf, AKf= aiJK{f,hi)hj, 

t(I)=2-H(K) 
t{J)=2-H{K) 

where hi is a Haar function on I (similarly hj), and the aijK are coefficients with 



a |< V™ 
0'Ijk\ S 



1^1 

It is also required that all subshifts 



map : L'^{R'^) -)■ L'^{W^) with norm at most one. 

The shift is called cancellative, if all the hi and hj are cancellative; otherwise, 
it is called non-cancellative. 

The notation Ak indicates an "averaging operator" on K. Indeed, from the 
normalization of the Haar functions, it follows that 



\AKf\ < 1 



K-f I/I 

Jk 



pointwise. 

For cancellative shifts, the boundedness is automatic from the other condi- 
tions. This is a consequence of the following facts: 

• The pointwise bound for each Ak implies that H^a'/IIlp < for all 
p € [1, c«]; in particular, these components of S are uniformly bounded on 
L? with norm one. (This first point is true even in the non-cancellative 

case.) 

• Let denote the orthogonal projection of L"^ onto span{/i/ : I C K,£{T) = 
2~'^i{K)}. When i is fixed, it follows readily that any two are orthogonal 
to each other. (This depends on the use of cancellative hi.) Moreover, 
we have Ak = IDjf A/flD^. Then the boundedness of S follows from two 
applications of Pythagoras' theorem with the uniform boundedness of the 
Ak in between. 

A prime example of a non-cancellative shift (and the only one we need in these 
lectures) is the dyadic paraproduct 

n,f = {b,hK){f)KhK = Y \K\-'^^{b,hK) ■ {f,h°K)hK, 

where b e BMO^ (the dyadic BMO space) and hK is a cancellative Haar function. 
This is a dyadic shift with parameters = (0, 0), where auK = \K\^^/^{b, hK) 
ioT I = J = K . The boundedness of the paraproduct, if and only if 6 G BMOd, 
is part of the classical theory. Actually, to ensure the normalization condition of 
the shift, it should be further required that ||6||BMO<i — 1- 
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2.C. Random dyadic systems; good and bad cubes. We obtain a notion of 
random dyadic systems by equipping the parameter set fl := ({0, l}'')^ with the 
natural probability measure: each component ojj has an equal probability 2^'^ of 
taking any of the 2'* values in {0, 1}'*, and all components are independent of each 
other. 

Let (f) : [0, 1] — >■ [0, 1] be a fixed modulus of continuity: a strictly increasing 
function with 0(0) = 0, 0(1) = 1, and t M- (f>{t)/t decreasing (hence (j)(t) > t) with 
limt^o (t'(t)/t — oo- We further require the Dini condition 

/ 0(i)— <oo. 

Main examples include (j){t) = H with 7 e (0, 1) and 

m = (i + ^iogi) \ 7> 1. 

We also fix a (large) parameter r G (How large, will be specified shortly.) 
A cube / G is called bad if there exists J e ^'^ such that ({J) > 2''i{I) and 

dist(/,5J)<0(|^)€(J): 

roughly, / is relatively close to the boundary of a much bigger cube. 

2.1. Remark. This definition of good cubes goes back to Nazarov-Treil-Volberg [15J 
in the context of singular integrals with respect to non-doubling measures. They 
used the modulus of continuity (j){t) = i^, where 7 was chosen to depend on the 
dimension and the Holder exponent of the Calderon-Zygmund kernel via 

a 

^ ^ 2{d + a)' 

This choice has become "canonical" in the subsequent literature, including the orig- 
inal proof of the A2 theorem. However, other choices can also be made, as we do 
here. 

We make some basic probabilistic observations related to badness. Let / G 
be a reference interval. The position of the translated interval 

I+uj = I + ^ 2^^ujj, 
r.2-i<i{i) 

by definition, depends only on ojj for 2^^ < £{I). On the other hand, the badness 
of I+uj depends on its relative position with respect to the bigger intervals 

J+LO = J+ ^ 2-^LUj+ ^ 2~^Wj-. 

r.2-:><i{I) ]-l(I)<2-i<i(I) 

The same translation component X]j-2-3<f(/) ^""''^i a-ppears in both / • +0; and 
J-i-w, and so does not affect the relative position on these intervals. Thus this 
relative position, and hence the badness of /, depends only on LUj for 2^^ > £{I). 
In particular: 

2.2. Lemma. For I G , the position and badness of I+lu are independent random 
variables. 
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Another observation is the following: by symmetry and the fact that the condi- 
tion of badness only involves relative position and size of different cubes, it readily 
follows that the probability of a particular cube I+oj being bad is equal for all cubes 

P„(/-i-a;bad) = TTbad = 7rbad('', d, (j)). 
The final observation concerns the value of this probability: 



2.3. Lemma. We have 

TTbad <8rf/ — 

in particular, TTbad < 1 if r = r{d,(f>) is chosen large enough. 



With r = r{d, </>) chosen like this, we tlien have TTgood 

:= 1 - TTbad > 0, namely, 

good situations have positive probability! 

Proof. Observe that in the definition of badness, we only need to consider those J 
with I (- J. Namely, if / is close to the boundary of some bigger ,/, wc can always 
find another dyadic J' of the same size as J which contains /, and then / will also 
be close to the boundary of J'. Hence we need to consider the relative position of 
I with respect to each J D I with £{J) = 2'^(.{I) and A; = r, r + 1, . . . For a fixed fc, 
this relative position is determined by 

j:l(I)<2-i <2''i(I) 

which has 2*"^ different values with equal probability. These correspond to the 
subcubes of J of size (.{I). 

Now bad position of / are those which are within distance (f>{£{I) / £{J)) ■ £{J) 
from the boundary. Since the possible position of the subcubes are discrete, being 
integer multiples of £{I), the effective bad boundary region has depth 

Ml) 









l£{I) 



- V\£{J)J £{I) 

by using that t < (f>{t). 

The good region is the cube inside J, whose side-length is £{J) minus twice the 
depth of the bad boundary region: 

Hence the volume of the bad region is 



< \ J\-M(p 



£{J)) 



by the elementary inequality (1 — aY > 1 — ad for a S [0, 1]. (We assume that r is 
at least so large that 4(^(2"'') < 1.) 
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So the fraction of the bad region of the total volume is at most 4:d(p{£{T) / £{J)) 
^d(j){2~'^) for a fixed k = r,r + 1, This gives the final estimate 



°° °^ 6(2''') 

,(/+u;bad) < ^4d0(2-'=) = ^8d^^^^3^2 



fc-i 

k=r k=r 



\Kix,y)-Kix',y)\ + \Kiy,x)-K{y,x')\ < ^±^^^{t_^) 



y2-fe-i * Jo t 

where we used that (p{t) jt is decreasing in the last inequality. □ 

3. The dyadic representation theorem 

Let T be a Calderon-Zygmund operator on R**. That is, it acts on a suitable 
dense subspace of functions in (for the present purposes, this class should 

at least contain the indicators of cubes in W^) and has the kernel representation 

Tf{x)= K{x,y)f{y)dy, a; ^ supp/. 

Moreover, the kernel should satisfy the standard estimates, which we here assume in 
a slightly more general form than usual, involving another modulus of continuity ip, 
like the one considered above: 

\K{x,y)\<y-^, 
\x - yr 

|a;-t/|" ^\x-y\ 

for all X, x', y G M"^ with |a; — > 2\x — x'\. Let us denote the smallest admissible 

constants Co and by ||iir||c2o and ||iir||c'2^, . The classical standard estimates 
correspond to the choice ip{t) = t", a G (0, 1], in which case we write ||-ftr||cZc 
\\K\\cz^- 

We say that T is a bounded Calderon-Zygmund operator, if in addition T : 
L^(R'') -> L^(M''), and we denote its operator norm by ||T||i2_^2.2 • 

Let us agree that | | stands for the norm on M**, i.e., \x\ := maxi<j<d 
While the choice of the norm is not particularly important, this choice is slightly 
more convenient than the usual Euclidean norm when dealing with cubes as we 
will: e.g., the diameter of a cube in the £°° norm is equal to its sidelength £{Q). 

Let us first formulate the dyadic representation theorem for general moduli of 
continuity, and then specialize it to the usual standard estimates. Define the fol- 
lowing coefScients for i,j G N: 

if mm{i,j} > 0; and 

T(z,j) := ^'(2-'"'^'^'t''J>(/)(2-'"^''{*>J>)-i), := / V'(s) — , 

Jo s 

if min{«, j} = 0. 

We assume that (f) and ip are such, that 

ij^o Jo <p{tr^y<p{t)'' t Jo \m) t 
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This is the case, in particular, when ip{t) — (usual standard estimates) and 
(l){t) = (1 + a^^ \ogt~^)~'^; then one checks that 

r(z,j) < P(max{z,j-})2-""-'^{^'^>, P{j) = (1 + j^^'^^^ 

which clearly satisfies the required convergence. However, it is also possible to 
treat weaker forms of the standard estimates with a logarithmic modulus ijj{t) ~ 
(1 + logi~^)~". This might be of some interest for applications, but we do not 
pursue this line any further here. 

3.2. Theorem. Let T be a bounded Calderon-Zygmund operator with modulus of 
continuity satisfying the above assumption. Then it has an expansion, say for f,gG 

oo 

{g,Tf) = c.{\\TU.^L2 + \\K\\cz^).E^ ^i'' j)i9, f) , 

where c is a dimensional constant and is a dyadic shift of parameters on 
the dyadic system i^"; all of them except possibly 5™ are cancellative. 

The first version of this theorem appeared in [3], and another one in [lOj. The 
present proof is yet another variant of the same argument. It is slightly simpler 
in terms of the probabilistic tools that are used: no conditional probabilities are 
needed, although they were important for the original arguments. 

In proving this theorem, we do not actually need to employ the full strength of 
the assumption that T : L^(M'*) -> L^(M'*); rather it suffices to have the kernel 
conditions plus the following conditions of the Tl theorem of David- Journe: 

\{1q,TIq)\ < CwBp\Q\ (weak boundedness property), 

Tl e BMO(R'*), T*l G BMO(M''). 

Let us denote the smallest Cwbp by ||2^||vkbp- Then we have the following more 
precise version of the representation: 

3.3. Theorem. Let T be a Calderon-Zygmund operator with modulus of continuity 
satisfying the above assumption. Then it has an expansion, say for f,g^ C^{M.'^), 

oo 

{g,Tf) = c-{\\K\\czo + \\K\\cz,.)^^ 5^ T(t, j){g, SI] f) 

inax{i,j}>0 

+ c • (lli^llczo + \\T\\wBp)E^{9, S°J>f) + E^(.g,n^i/) + E^{g, (H^.i)*/) 

where SI] is a cancellative dyadic shift of parameters {i,j) on the dyadic system 
S!^ , and is a dyadic paraproduct on the dyadic system '3^ associated with the 
BMO-function b e {Tl,r*l}. 

3.4. Remark. Note that = ||6||bmo • S"^, where = n^/||6||BMO is a shift 
with the correct normalization. Hence, writing everything in terms of normalized 
shifts, as in Theorem IS^l we get the factor ||ri||BMO < \\T\\l2^l^ + \\K\\cz,p 
in the second-to-last term, and ||r*l||BMO ^ II^^IIl^^l^ + H-fS^Hc^^ in the last 
one. The proof will also show that both occurrences of the factor ||iir||cZo could 
be replaced by HTHl^^l^, giving the statement of Theorem 13.21 (since trivially 

\\T\\wBP < nu2^i2). 



8 



T. P. HYTONEN 



As a by-product, Theorem 13.31 delivers a proof of the Tl theorem: under the 
above assumptions, the operator T is already bounded on L^(R'^). Namely, all 
the dyadic shifts S^^ are uniformly bounded on Li^iW^) by definition, and the con- 
vergence condition p.ip ensures that so is their average representing the operator 
T . This by-product proof of the Tl theorem is not a coincidence, since the proof 
of Theorems 13.21 and 13.31 was actually inspired by the proof of the Tl theorem 
for non-doubling measures due to Nazarov-Treil-Volberg ^15j and its vector-valued 
extension [S]. 

A key to the proof of the dyadic representation is a random expansion of T in 
terms of Haar functions hi, where the bad cubes are avoided: 

3.5. Proposition. 

= lgood{sma\\eT{I, . J}) ■{g,hj){hj,Thi}{hi J), 

where 



smaller{/, J} 

Proof. Recall that 



/ if£[I)<l{J), 
J ift{J)>t{I). 



for any fixed a; e $7; and we can also take the expectation of both sides of this 
identity. 
Let 



lgood(i"+w) := 



1, if I+uj is good, 
0, else 



We make use of the above random Haar expansion of /, multiply and divide by 

TTgood = Pc^ (Z+o; good) = E„lgood(/+w), 
and use the independence from Lemma 12.21 to get: 
(g,T/)=E„^(g,T/.,+^)(/i,+^,/) 

= ^Y.^4hooa{I+u^)]M{9,Thj^J{hj+^J)] 
= Z—^'^ (^+'^) ^9, Thj^J{hj^^ , /) 



TTgood 



E„ Igood (Z+w) {g, hj^J {hj^^ , Thj^J{hj^^ , /) . 

■""good J J 
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On the other hand, using independence again in half of this double sum, we have 
H ^oj[lgood{I+u}){g,hj^J{hj+^,Thj+J{hj+^,f)] 

= E E„[lgood(/+w)]E,[(<?,/i^+,)(/i^+,,r/i,+,)(/i,+,,/)] 

= ^uj XI {9,hj+J{hjjf_^,Thijf_J{hijf_^,f), 
e{i)>i{j) 

and hence 

(5,r/> = ^E^ X igood(/+w)(5,/ij+c.>(/ij+.,r/i,+„)(/i,+„,/) 

^^"""^ i(i)<m 

+ X {g,hj^J{hj^^,Thj^J{hj^^J). 
i{i)>i{J) 

Comparison with the basic identity 

(3.6) {g,Tf)=^^J2<g,hj^J{hj^^,Thj^J{hj^^,f) 

i,j 

shows that 

e{i)<e(J) 

= - hood{I+u)){g,hj^J{hj^^,Thj^J{hi^^J). 

TTgood e(i)<e(j) 

Symmetrically, we also have 

Eo. Yl i9,hj+J{hj^^,Thj^J{hj^^,f) 
e{i)>e{j) 

= ^—^0, Y hood{J+i^){g,hj+J{hj^^,Thj^J{hj^^,f), 

and this completes the proof. □ 

This is essentially the end of probability in this proof. Henceforth, we can simply 
concentrate on the summation inside E^^, for a fixed value of a; € O, and manipulate 
it into the required form. Moreover, we will concentrate on the half of the sum with 
£{J) > the other half being handled symmetrically. We further divide this 

sum into the following parts: 

Y = E E 

eii)<e{j) Aist{i,j)>e{j)<j>{e{i)/e{j)) icj i=j dist(i,j)<i(.j)<p{t{i)/t(j)) 

= '■ Cout + Cin + CT= + CTnear- 

In order to recognize these series as sums of dyadic shifts, we need to locate, for 

each pair (/, J) appearing here, a common dyadic ancestor which contains both of 
them. The existence of such containing cubes, with control on their size, is provided 
by the following: 
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3.7. Lemma. If I ^ Si is good and J ^ S is a disjoint (J D I = 0) cube with 
£{J) > then there exists K D lU J which satisfies 

£{!<) < 2^£{I), if dist(J, J) < ^{J)^{j^) , 

Proof. Let us start with the following initial observation: if K G ^ satisfies I C K, 
J C K", and i{K) > 2n{I), then 

W<p{j^) < dist{I,dK) = dist{I,K'=) < dist(7, J). 

Case dist(/, J) < i{J)(j){i{I)/£{J)). As In J = 0, we have dist(/, J) = dist(/,aj), 
and since / is good, this implies £{ J) < 2^£{I). Let K = I^^\ and assume for 
contradiction that J C K'^. Then the initial observation implies that 

Dividing both sides by £{I) and recalling that (j){t) jt is decreasing, this implies that 
£{K) < £{J), a contradiction with £{K) = 2''£{I) > £{J). Hence J K°, and since 
£{J) < £{K), this implies that J C K. 

Case dist(/, J) > £{J)(j){£{I)/£iJ)). Consider the minimal K D I with £{K) > 
2H{I) and dist(7, J) < £{K)(j){£{I)/ £{K)) . (Since (j){t)/t oo as f ^ 0, this bound 
holds for all large enough K.) Then (since (j){t)/t is decreasing) £{K) > £{J), and 
by the initial observation, J ^ K'^. Hence either J C K, and it suffices to estimate 
£{K). 

By the minimality of K, there holds at least one of 



^£{K) < 2^£{I) or < dist(7, J), 



and the latter immediately implies that £{K)(t){£{I) / £{K)) < 2dist(/, J). In the 
first case, since £{I) < £{J) < £{K), we have 

'Mm) ^ "<'>(||) s 'Mm) < ^''''^'('•■"' 

so the required bound is true in each case. □ 
We denote the minimal such K hy I \/ J, thus 

7 V J := Pi K. 

KDIUJ 

3. A. Separated cubes, CTout- We reorganize the sum Uoui with respect to the new 
summation variable K = I V J, as well as the relative size of / and J with respect 
to K: 



OO OO 



j=i i=j K dist{i,j)>e{j)4>{t{i)/t{J)) 

I\IJ=K 

e{i)=2-'i{K),i{.i)=2-'e{K) 
Note that we can start the summation from 1 instead of 0, since the disjointness 
of / and J implies that K = I y J must be strictly larger than either of / and 
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J. The goal is to identify the quantity in parentheses as a decaying factor times a 
cancehative averaging operator with parameters (i, j). 



3.8. Lemma. For I and J appearing in (Tout 7 we have 



\K\ ^\£{K)J ^\i{Ky\e{K) 
Proof. Using the canceUation of hi, standard estimates, and Lemma 13.71 

\{hj,Thj)\ 



K = IW J. 



hj{x)K{x, y)hi{y) dy dx 
hj{x)[K{x,y) - K{x,yi)]hi{y) dydx 
£ llAlc...,/ |,.,W|g-ji-^,*(j-|£^)|.,to)|d!, 



dx 



3.9. Lemma. 



< 117^1 



E 



1 J V i( J V'\ /rr\ /m 



l{KY'\l{K)) ^\£{Ky\i(K), 
lgood(/) • {g,hj){hj,Thi){hi,f) 



□ 



dist{i,j)>e{j)(t>(e{i)/e{J)) 

IVJ=K 
•.(I)=2-H(K)<t(J)=2-n(K) 



= ||if||czX2-)-V(2-V(2-)-i)(,g,A^/), 
where A^^ is a cancellative averaging operator with parameters {i,j). 
Proof. By the previous lemma, substituting £{I)/£{K) = 2~*, 



\{hj,Thj)\ < \\K\\cz,^^^<l>{2-TM'2-'m-T'), 
and the first factor is precisely the required size of the coefhcients of A^. □ 
Summarizing, we have 

00 00 

j = l i=3 

3.B. Contained cubes, o-jn. When I C. J, then / is contained in some subcube 
of J, which we denote by J/. 

{hj,Thi) = {lj.hj,Thi} + {lj^hj,Thi) 

= {lj.hj,Thj) + {hj)j,{lj,,Thi) 

= (lj|(/i.7 - (hj)j,),Thi) + {hj)i{l,Thi), 

where we noticed that hj is constant on J/ D /. 
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3.10. Lemma. 



where 



*(r) := / 

^0 



dt 



and lli^llczo could be alternatively replaced by \\T\\]^2^]^2. 
Proof. 

\{ljc{hj-{hj)j,),Thi)\<2\\hj\\^ [ \Thi{x)\dx, 

Jjf 

where \\hj\\oo = | Jl"'/^. 

Case > 2-H{.J). We have 



/ \Thi{x)\dx<[ [ K{x,y)hi{y)dy 

Jj? J3I\I J 

+ [ [ [K(x,y)-K{x,yj)]hi{y)dy 



I (31) 

< mczo 



dx 



■^dy dx\\hi\\oo 
£{I) 



\\hi\\i dx 



< 



3i\iJi \x-y\ 

\\K\\czM\\\hi\\o. + \\K\\cz, r -M^)r''-Ur\\hjh 

Ji(i) ^ \ r y 

= \\K\\czo\I\'/' + \\K\\cz,J^ V-WylJ-l'/' 

<(||/^||czo + ||i^||czJ|/r/' 

by the Dini condition for tp in the last step. 

Alternatively, the part giving the factor ||iir||czo could have been estimated by 



3/\/ 



K{x,y)hT{y)dy dx < \3I\inThih < {inn 



Case £{I) < 2 ^i{J). Since / C J/ is good, we have 



and hence 



\cz^ 



dist(/,J|)>.W^(^)>W(M) 

[ \Thi{x)\dx<\\K\\ 
Jjf 

< \\K\\cz, 

r 

= \\K\\cz, / 

^0 



J, d(a:7F^(di|^)""^^""^'^^ 

^^(^)r-^dr.||/.,|K 







□ 
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Now we can organize 



< T.T.(9,^j)ajfihj - {hj)j,),Thj){hj,f) = Y.J2 E ' 

J ICJ 

and the inner sum is recognized as 



J /CJ i=l J ICJ 

£(/)=2-f(J) 



(lli^llczo + \\K\\cz,)^{2-^^i2-T'){9.Ayf), 

or with ||T'||/^2^^2 in place of ||-fS'||cZo- for cancellative averaging operator of type 
(z,0). 

On the other hand, 



J ICJ 
I JDI 

= J2(9)j{T*l,hj){hj,f) 
I 

J2{g)i{T*l,hi)hij) {UT.igJ) - {9,U*t.J). 



I 

Here 11^. i is the paraproduct, a non-cancellative shift composed of the non-cancellative 
averaging operators 

Aig - {T*l,hi){g)ihi - {T*l, hj) ■ {g,h°)hi 

of type (0,0). 

Summarizing, we have 

Cin = (Tin + (Tin 

CO 

= {\\K\\czo + Ili^llczJ E*(2-V(2-^)-i)(.9,5*"/) + (nrnS,/), 

i=l 

where ^(t) ~ ijj{s) ds/ s, and ||if||cZo could be replaced by ||r||i2^i2. Note 
that if we wanted to write Tlx* i in terms of a shift with correct normalization, we 
should divide and multiply it by ||T'*1||bmOi thus getting a shift times the factor 

||r*l||BMO<||T|U2 + ||i^||cz^ 

3.C. Near-by cubes, cr^ and (Tnear- We are left with the sums <t= of equal cubes 
I — J, as well as Uncar of disjoint near-by cubes with dist(/, J) < £{J)(f){£{I) /£{J)). 
Since / is good, this necessarily implies that £{I) > 2~^£{J). Then, for a given J, 
there are only boundedly many related / in this sum. 

3.11. Lemma. 

\{hj,Thi)\ < ||i^||cZo+^/,/||r||^sp. 

Note that if we used the L^-boundedness of T instead of the CZq and WBP 
conditions (as is done in Theorem 13.21 we could also estimate simply 

\{hj,Thi)\ < \\hjh\\T\\L2^L4hih = WTWl-^l^-- 
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Proof. For disjoint cubes, we estimate directly 

i(/jj,rMi < ii^^iiczoy|y^^4^dyd.Tii/ijiiooii/i/iioo 

<mcz/[ f -^d,d.|jrv2|,|-i/2 

J J J3j\j F ~ y\ 

<\\K\\czo\J\\J\-'^V\-'^' = \\K\\czo, 

since |/| ~ |J|. 

For J = I, let li be its dyadic children. Then 

\{hj,Thj)\ < J2 \{hi)iAhi)iAh.,Tlj^)\ 

^ 7/. J I, 1^ - yl" Y 

<\\K\\cZo + \\T\\wBP, 

by the same estimate as earlier for the first term, and the weak boundedness prop- 
erty for the second. □ 

With this lemma, the sum is recognized as a cancellative dyadic shift of type 
(0, 0) as such: 

0-= = ^ lgood(/) • {g,hi){hi,Thi){hiJ) 
= {\\K\\czo + mwBp){g,S"°f), 

where the factor in front could also be replaced by ||r||i2^i2. 

For / and J participating in (Tncar, we conclude from Lemma 13.71 that K := ly J 
satisfies i{K) < 2'^£{I), and hence we may organize 



EEE E 



i=l j=l K I,J:IyJ=K 

distil, J)<t(J)<^(t(I)/t(J)) 
e{I)=2-'l{K) 
f.{.J)=2-H(K) 

and the innermost sum is recognized as \\K\\czo{9i A^k^) some cancellative 
averaging operator of type (i, j). 
Summarizing, we have 

r r 

a. +a„ear = (||i^||cZo + 11^1 ksp) (.g, 5°"/) + || 1 1 CZo E E ' /) , 

J = l i=] 

where S'^^ and S'^^ are cancellative dyadic shifts, and the factor ||cZo + ll^llivsp) 
could also be replaced by ||r||i2_^^2. 
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3.D. Synthesis. We have checked that 
hoodiI){9,hj){hj,Thi){hiJ) 

l<j <'i<CXD 

+ *(2^V(2-*)-i))(.g,5«/)) 

l<i<oo 

+ {\\K\\czo + nkBp)(.g,5"V) + (ff,n5,. J) 

where ^{t) — "iACs) ds/s, Ht^*! is a paraproduct — a non-canceUative shift of type 
(0,0)-, and all other S'*-' is a caiicellative dyadic shifts of type 

By symmetry (just observing that the cubes of equal size contributed precisely 
to the presence of the cancellative shifts of type (i,«), and that the dual of a shift 
of type (i,j) is a shift of type (j, «)), it follows that 

Yi ^goodiJ){g,hj){hj,Thi){hj,f) 
i{i)>i{J) 

1< i<j <oo 

+ Y *(2-^0(2-^r'))(5,^°-'/)) +(5,nTi/) 

l<j<oo 

so that altogether 

X]lgood(min{/, J}){g, hj){hj, Thj){hi, f) 
I, J 

oo 

= (||i^ llczo + \\K\\cz,) ( Y *(2-V(2-')-'))((3, S^'f) + {g, 5°7)) 

OO 

+ Y (?!'(2"'°''''(*'^'))"''V'(2"'"''''^*'^'V(2"'°'''''''^'^)"^)(.g,5*V)) 

+ (lli^llczo + \\T\\wBp){g,s''''f) + (5,nTi/) + (ff,nj.i/), 

and this completes the proof of Theorem 13.21 

4. Two-weight theory for dyadic shifts 

Before proceeding further, it is convenient to introduce a useful trick due to 
E. Sawyer. Let a be an everywhere positive, finitely-valued function. Then / £ 
U'{'w) if and only if — f / a G U'{<t^w)^ and they have equal norms in the 
respective spaces. Hence an inequality 

(4.1) l|r/|Up(„) <7v||/|Up(^) y.feLP{w) 

is equivalent to 

This is true for any cr, and we now choose it in such a way that a'^w = cr, i.e., 
a = ti;^i/(p^i) = w^'P , where p' is the dual exponent. So finally (|4.ip is equivalent 
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to 

\\T{(ba)\\L,(^^j < N\mLp(a) ^^eLPia). 
This formulation has the advantage that the norm on the right and the operator 

T{(t)a){x) = J K{x, y)(j){y) ■ a{y) dy 

involve integration with respect to the same measure a. In particular, the A2 
theorem is equivalent to 

l|r(/0-)||L2(^) < Ct[w]a2 Wfh^a) 

for all / e L'^{w), for all w ^ A2 and a — w^^. But once we know this, we can also 
study this two-weight inequality on its own right, for two general measures w and 
(T, which need not be related by the pointwise relation (t(x) = l/w{x). 



4.2. Theorem. Let a and w be two locally finite measures with 

MQMQ) 
\Q\' 

Then a dyadic shift S of type {i,j) satisfies S{a-) : L'^{cr) — > L'^(w) if and only if 



[w, a\A2 := sup —2 < 00. 



^ UqS{(^Iq)\\l^w) ||lQS'*(wlQ)||i2(^) 

© := sup „,i ,„ , 6 :=sup- 



are finite, and in this case 

\\Si<J-)\\LHa)^L-(n.) < (1 + ^){& + &1 + (1 + '^^)^[W, oUl, 

where k = max{i, j}. 

This result from my work with Perez, Treil, and Volberg [10] was preceded by 
an analogous qualitative version due to Nazarov, Treil, and Volberg [16J . 

The proof depends on decomposing functions in the spaces L'^{w) and L^{cr) in 
terms of expansions similar to the Haar expansion in L'^{M.'^). Let be the orthog- 
onal projection of L'^{cr) onto its subspace of functions supported on /, constant 
on the subcubes of /, and with vanishing integral with respect to da. Then any 
two are orthogonal to each other. Under the additional assumption that the a 
measure of quadrants of M.'^ is infinite, we have the expansion 

QeS) 

for all / G i^(o'), and Pythagoras' theorem says that 

Qes> 

(These formulae needs a slight adjustment if the a measure of quadrants is finite; 
Theorem 14.21 remains true without this extra assumption.) Let us also write 

ICK 
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For a fixed i e N, these are also orthogonal to each other, and the above formulae 
generalize to 

1/2 



The proof is in fact very similar in spirit to that of Theorem 13. 21 it is another Tl 
argument, but now with respect to the measures a and w in place of the Lebesgue 
measure. We hence expand 

and estimate the matrix coefficients 



(4.3) 



^ ^ aijK{n'"Rg,hj)^{hi,WQf)„. 

K I.JCK 



For {hi,B'^f)„ ^ 0, there must hold / n Q ^ 0, thus / C Q or Q C /. But in the 
latter case /i/ is constant on Q, while / Uq/ • cr = 0, so the pairing vanishes even 
in this case. Thus the only nonzero contributions come from / C Q, and similarly 
from J C _R. Since /, J C A', there holds 

(/ C g C a: or a: C Q) and ( J C i? C a: or K <^R). 

4. A. Disjoint cubes. Suppose now that Q O R — 0, and let i^T be among those 
cubes for which Ak gives a nontrivial contribution above. Then it cannot be that 
K C since this would imply that Qr\R^KnJ = J^0, and similarly it 
cannot be that K C R. Thus Q,R K, and hence 

QV RCK. 

Then 

|(D-5,5(aD-/))^| < ^ \{Blg,AK{aB^Qf)U 

KDQVB. 

m9\\LH^)\\^Qf\\LH-) 



< 



^ \K\ 

< \\BJ^,g\\LH^)\\WQfh^(,^ 
\QVR\ 

On the other hand, we have Q D I, RD J ior some I.JCK with ^(/) = 2-*£(A:) 
and t{J) = 2-H(K). Hence 2-H{K) < £{Q) and 2-J£(is:) < £{R), and thus 

Now it is possible to estimate the total contribution of the part of the matrix 
with Q n R — 0. Let P := Q V i? be a new auxiliary summation variable. Then 
0,i? C P, and e{Q) = 2-''£{P), £{R) = 2-^£{P) where a = 1, . . . ,i, b = 1, . . . J. 
Thus 



E \m9,S{aWQf)U 



Q,ReS> 
QnR=0 
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1^1 



0=1 6=1 P6® ' ' Q,Re&:QyR=P 
l{Q)=2--i{P) 



^ E E ^ E e 

a, 6=1 Pe® ' ' RCP QQP 

t{R)=2-''l{P) t{Q)=2-'e{P) 

~ E E E 

a, 6=1 Pe® ' ' pep 

£(P)=2-''^(P) 

(by disjoint supports) 

= E E ^Pp'ffiu^MppViiLH.) 



E ^ 

QCP 

t.{Q)=2-''e{p) 



LHa) 



a, 6=1 Pe® 



a(F)i/2^(P)i/2 ■ 
< 2^ 2^ 1^1 l|ID)p'-'5lU2(«-)l|IDp 



0,6=1 PG® 
i,3 



< E[^'<' Eiii^p"5iii^w 



a, 6=1 Pe® 
ll/2| 



1/2 



E ii^pVii 

Pe® 



2 

L2(<t) 



1/2 



4.B. Deeply contained cubes. Consider now the part of the sum with Q C R 
and £{Q) < 2-'£{R). (The part with RcQ and £{R) < 2-H{Q) would be handled 
in a symmetrical manner.) 

4.4. Lemma. For all Q d R with £{Q) < 2^^£{R), we have 

(©^,5,5(aID)5/))„, = (©^,5)q(.,(5*(w1q(.)),ID)5/)<x, 

where further 

D5S'*(w1qw) = B^gS^wlp) for any P D Q'^). 

Recall that Dg = (Bg)^ = (Uq)* is an orthogonal projection on L'^{a), so that 
it can be moved to either or both sides of the pairing ( , 

Proof. Recall formula (|4.3I) . If {hj ,3'Qf)o- is nonzero, then I QQ, and hence 

J CK ^ C C R 
for all J participating in the same Ak as /. Thus HJ^g is constant on Q^*-*, hence 

= »)Q(.,(lQ(0,^if(aB5/))^ 
= (D^ff)-«(AJ,(wlQ(.)),B5/).. 
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Moreover, for any P D g(*) D K, 

{WqA],{wIq,.,)J)„ = (lQ<.),AK(aB5/)). 

AK{(TWQf)w 



- (lp,AK(aP5/)),„ = {WqA],{wIp)J),. 

Summing these equalities over all relevant K, and using S = '^j^Ak, gives the 
claim. □ 

By the lemma, we can then manipulate 
^ (ID)^g,5(aP5/))„ 



Q,R:QCR 
«(Q)<2-'f(fl) 



E(3)i(^*M«), E ^q/ 

i? QCi? 

^(Q)=2-'^(_R) 



where (5)^ := w{R)'^^ Jj^ g ■ w is the average of g on i? with respect to the w 
measure. 

By using the properties of the pairwise orthogonal projections B^'' on L'^ia). the 
above series may be estimated as follows: 



Q,R:QCR 
£(Q)<2-'f(_R) 



<Y.\{gmB^/S%wln)\\LH.)\\n'f\\LH^ 

R 

(EK5)^niiDS'^*MH)iii^(.))'^'(Eii®sv 



< 



where the last factor is equal to ||/||l2(^). 

The first factor on the right is handled by the dyadic Carleson embedding the- 
orem: It follows from the second equality of Lemma [4.4[ namely B)'qS*{wI'q) = 

I])'^S*{wlp) for aU P D QW, that B^/S^wIr) = I])'^S*{wlp) for all P C R. 
Hence, we have 

E \m's*{win)\\i^,-, = E ii©sxip5*(^ip))iii.(.) 

RCP RCP 

< \\lpS*{wlp)\\l.,,.<ela{P) 
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by the (dual) testing estimate for the dyadic shifts. By the Carleson embedding 
theorem, it then follows that 



(E \{9rRf\\K'S*{wln)\\hi.)) < 6*||5lU^(.), 

R 

and the estimation of the deeply contained cubes is finished. 

4.C. Contained cubes of comparable size. It remains to estimate 

E »,S(aD^/))^; 

Q,R:QCR 
1{Q)>2-H(R) 

the sum over RC.Q with ^{R) > 2^H{Q) would be handled in a symmetric manner. 
The sum of interest may be written as 

i i 

a=0 R Q^R a=0 R 

«(Q)=2-»£(iJ) 

and 

2d 

k=l 

where the Rk are the 2'' dyadic children of R, and (D^;?)/}^ is the constant valued 
of P^fif on Rk. Now 

{S*{wlnM'f)'r = {lR,S*{wln,),K'f)^ + {S*{^1r.), 1k|BS7>., 

where 

|(1h,5*(«;1hJ,©SV>.I < eMRkY^'WK'fh^.) 

and, observing that only those A'^ where K intersects both Rk and R^. contribute 

to the second part. 

\{S*{wln,),lRiD'^d'fU = \ E (A|.MijJ,lH|DS7). 

KDRk 

^ E T^MRkmRfhH'^) 

KDR ' ' 
< \w,a\AMRkY'''WRl\W{ay 

It follows that 
and hence 

i»,5W/)>-i < (6* + \wAA;)WR9\Wi.^)WR'f\w^.-) 
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Finally, 

i 



a=0 R 



1/2 / \ 1/2 



a=a R R 



< (1 + Z)(©* + [w,a]A,)\\g\\LH^)\\f\\ma)- 

The symmetric case with R C Q with > 2^H{Q) similarly yields the factor 
(1 + + [w, crjyij). This completes the proof of Theorem 14.21 



5. Final decompositions: verification of the testing conditions 
We now turn to the estimation of the testing constant 

||1qS'(ct1q)||l2(^) 

Bounding 6, is analogous by exchanging the roles of w and a. 
5. A. Several splittings. First observe that 

K:KnQ^0 KCQ KDQ 

The second part is immediate to estimate even pointwise by 
and hence its L'^{w) norm is bounded by 

L^.^r — \Q\ — 

So it remains to concentrate on C Q, and we perform several consecutive split- 
tings of this collection of cubes. First, we separate scales by introducing the 
splitting according to the k + 1 possible values of \og2^{K) mod (k + 1). We 
denote a generic choice of such a collection by 

J(f ^ Jfk -.^ {K <Z Q : log2 ^{K) = k mod (k + 1)}, 

where k is arbitrary but fixed. (We will drop the subscript k, since its value plays 
no role in the subsequent argument.) Next, we freeze the A2 characteristic by 
setting 



° :^ £ : 2"-i < ^(-^M-^) < 2°}, aeZ, a < [log^K , 

where [ ] means rounding up to the next integer. 

In the next step, we choose the principal cubes P £ C J(f°-. Let 
consist of all maximal cubes in J^f^, and inductively ^p^i consist of all maximal 
P' e JT"" such that 
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Finally, let := IJ^g For each K e JT", let ^"-{K) denote the minimal 
P e such that K (^P. Then we set 

jr''(P) := {i^ e JT'^ : IV''{K) = P}, P e 

Note that a{K)/\K\ < 2cr{P)/\P\ for all K e jr''(P), which allows us to freeze 
the cr-to-Lebesgue measure ratio by the final subcoUections 

je'^'^iP) := {k e Jf'^iP) : 2-^ < '^^1^1 j^^^ < 2i-^}, b G N. 

We have 

K 

{Ke&:KCQ}=\J.jek, jTfe^jr^ y JT", 

fc^O a< [log2[Ti',cr]A2l 

oo 

U ^°(p), jr°(p) = U ^b°(P), 

Pe5^» h=o 

where all unions are disjoint. Note that we drop the reference to the separation-of- 
scales parameter fc, since this plays no role in the forthcoming arguments. Recalling 
the notation for subshifts = "YIiK^s-^k, this splitting of collections of cubes 
leads to the splitting of the function 

K OO 

Y: A^ialQ) = E E E E^.^."(p)(<^1q)- 

On the level of the function, we split one more time to write 

oo 
n=0 

ESiP,n) {xeR'': n2-\a)p < |5^^.(p) (alQ)(a;)| < (n + l)2-^a)p}. 

This final splitting, from |8], is not strictly 'necessary' in that it was not part of 
the original argument in [5], nor its predecessor in [TI], which made instead more 
careful use of the cubes where Sjg-af^p^^alg) stays constant; however, it now seems 
that this splitting provides another simplification of the argument. 

Now all relevant cancellation is inside the functions Sj^af^p-^i^alq), so that we 
can simply estimate by the triangle inequality: 



E ^Kialg] 



KCC 



-EE E EE(^+")2 ''{<^)p'^{\S^a^^^ialQ)\>n2-Ha)^}, 
k=0 a Pe^^ ft=0 n=0 



and 



E AK{alQ) 



KCQ 

K oo oo 

^EEE2~''E(i+'^)|| E ('^)^^i{is,^»(P,(-i«)i>«2-M-)p} 

k=0 a b=0 n=0 Pe.^" 



Obviously, we will need good estimates to be able to sum up these infinite series. 
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Write the last norm as 



1/2 



observe that 



{|^jr,o(P)HQ)l > n2->'{a)p} C P, 



and look at the integrand at a fixed point x £ R"^. At this point we sum over 
a subset of those values of {a) p where the principal cube P 3 x. Let Pq be the 
smallest cube such that |S'jr^"(p)| > n2~''{a)p, let Pi be the next smallest, and 
so on. Then (cr)p^ < 2~^{a)p^_^ < ... < 2~"^{a)pg by the construction of the 
principal cubes, and hence 



oo 



<4 ^ (0-)pl{|Sj^a(^)(alQ)|>n2- 



Hence 



II 51 (o-)pl{|S^a(p)(alQ)|>n2-<'(o->p} 

pg^a 

= 2( ^ {cj)lw{{\S^.^P){alQ)\>n2-\a)p}))"\ 

and it remains to obtain good estimates for the measure of the level sets 

{\S,^.(P){alQ)\>n2-\a)p}. 

5.B. Weak-type and John— Nirenberg-style estimates. We still need to es- 
timate the sets above. Recall that 5je;»(p) is a subshift of 5, which in particular 

has its scales separated so that log2 (-{K) = k mod (k + 1) for all K for which Ak 
participating in Sj(^^(^p) is nonzero and k G {0, 1, . . . , At := max{i, j}} is fixed, S 
being of type (i, j). The following estimate deals with such subshifts, which we 
simply denote by S. 

5.1. Proposition. Let S he a dyadic shift of type {i,j) with scales separated. Then 

|{|5/|>A}|<|||/||i, VA>0, 
where C depends only on the dimension. 

Proof. The proof uses the classical Calderon-Zygmund decomposition: 

f = g + b, b:=^bL:=J2^B{f-{f)L), 
Less 
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where L ^ SS are the maximal dyadic cubes with > A; hence < 2''A. 

As usual, 

satisfies llfflU < and < hence Ml < llffllooll^lli < 2'*A||/||i, and 

thus 

KI.S5I > ^A}| < ±\\Sg\\l < ^\\g\\l < 4.2''i||/||i. 
It remains to estimate {\Sb\ > ^A}. First observe that 

sb=J2J2 ^^^L = E ( E ^^^i + E ^^^^)' 

Le39 LeSg KCL KDL 

since AxbL ^ only ii K n L ^ 0. Now 

\{\sb\ > iA}| < |{| E E ^^^^1 > o}| + 1{| E E ^^^^ 

LeSgK<ZL Le^KDL 

^Ew + ll|E E^-^^IL 

Less LemKDL 

<^ii/iii + ^E Eii^^^^iii' 

Le.18 KDL 

where we used the elementary properties of the Calderon Zygmund decomposition 
to estimate the first term. 

For the remaining double sum, we still need some observations. Recall that 

AKbL= ^ aijKhi{hj,bL)- 

I,JCK 

t(I)=2-H{K) 

Now, if £(A') > 2''e{L) > 2H{L), then 1{J) > £(L), and hence hj is constant on 
L. But the integral of &l vanishes, hence {hj,bL) = for all relevant J, and thus 
AKbL = whenever £{K) > 2''£{L). 

Thus, in the inner sum, the only possible nonzero terms are Axbi, for K = L^"*' 
for m = 1, . . . , At. By the separation of scales, at most one of these terms is nonzero, 
and we write L for the corresponding unique K. So in fact 

f E E ii^^^^iii = T E W'^LbLh <jj2 ii^^iii ^ T • = tII/iii 

LessKDL Le^ Less 

by using the normalized boundedness of the averaging operators Ai on (R'^), and 
an elementary estimate for the bad part of the Calderon-Zygmund decomposition. 
Altogether, we obtain the claim with C = 4 • 2'' + 5. □ 

For the special subshifts Sj^a^^p-^, we can improve the weak- type (1, 1) estimate 
to an exponential decay: 

5.2. Proposition. Let Sj(f^(p) be the subshift of S as constructed earlier. Then the 
following estimate holds when v is either the Lebesgue measure or w: 

v([\Sj^^^(P){alQ)\ > C2-\a)p-t}) < C2-*u{P), t > 0, 

where C is a constant. 



>iA} 
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Proof. Let A :— C2 ^{a)p^ where C is a large constant, and n e 1,+ . Let a; 6 E"* 
be a point where 

(5.3) \S^.(p^{(jlQ){x)\>n\. 

Then for all small enough L e ,J^f^{P) with L 3 x, there holds 



AK{alQ){x) 



KDL 



> n\. 



Since (ctIq) is constant on L (thanks to separation of scales), and 



KDL 



(5.4) 

it follows that 
(5.5) 



P.(.1,)|U<^<2-^, 



AKicrlg) >(n-f)A onL. 

KDL 



Let ^ C J??^° (P) be the collection of maximal cubes with the above property. Thus 
all L G are disjoint, and all x with (|5.3p belong to some L. By maximality of L, 
the minimal L* G J^^°(S') with L* D L satisfies 



53 AA-(fTlQ) <(n-|)A onr 



Kejxr.'^iP) 

KDL' 



By an estimate similar to (|5.4p . with L* in place of i, it follows that 



KGj(r,^{p) 

KDL 



>iA. 



Thus, if X satisfies (|5.3p and x E L E then necessarily 

\S{Keje-{py,KCL}i(^'^Q)ix)\ = | ^ ^i<-(crlQ)(j:) 

ifejri°(P) 

Using the weak-type estimate to the shift S^j^^^yf^af^pyj^i^L} of type with 
scales separated, noting that ^^'(o'Iq) — Ak{(jIl) for K C L, it follows that 

{| Y A,,ialQ)ix)\>l\}\<^aiL) 



KCL 



C ,_,a{SnQ) 

- a' 1^1 



L\<m 



provided that the constant in the definition of A was chosen large enough. Recalling 
([53)) . there holds 

I E ^KialQ) >\ Y AKialQ) -\ Y Ak{<71q) 

K€je-{P) 



KGJ(r,^(P) 
KDL 



ifejr,°(p) 

KCL 



> {n - |)A - iA = (n - 1)A on Z C L with \L\ > 
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Thus 

|{|5jr,»(P)(alQ)| > nX}\ < J2 \Ln{\Sj^.^p){alQ)\ > n\}\ 

< ^ \{\S{Ke^,'^iP):KCL}ialQ)\ > iA}| 

<i ^ |Ln{|5^^.(P)(alQ)|>(n-l)A}| 
<k\{\Sj^,''ipMQ)\>{n-m\- 

By induction it follows that 

|{|5^^<.(P)(alQ)| > nA}| < 2-^\{\Sj^a^p){alQ)\ > 0}| 
<2-" \M\ <2-"|P|, 

where ^ is the collection of maximal cubes in J^"(S'). 

To deduce the corresponding estimate for the w measure, selected intermediate 
steps of the above computation, as well as the definition of J(f^{P), will be exploited. 

Recall that K e X'' moans that 2"-^ < {w)k{(t)k < 2", while K e ^°(-P) means 
that in addition 2~'' < {a)K/{<^)p < 2^"''. Put together, this says that 

Hence, using the collections C ^"(P) as above, 

«;({|m^^»(P)(alQ)| > nA}) < ^ w{V) < ^ 2^+'{a)p\L\ 

< 2«+''(<7)p|{|m^^.(p)(alQ)| > (n- 1)A}| 

< 2«+''((7)p • 2"" \M\ 



<4-2-" ^ w;(M) <4-2-"«;(5). □ 



5.C. Conclusion of the estimation of the testing conditions. Recall that 
II E ^k{<jIq) 



KCQ 



^EEE^ 'E(l + ")|| E {^)ph\S^,^iP)i''iQ)\>n2-Ha)p} 
fe=0 a 6=0 n=0 Pe^" 
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and 



(c^)^^l{lS^a(^.)(^lQ)l>n2-<'(a)p} 



1/2 



<2( 5^ {a)%w{{\Sj^a^P){alQ)\>n2-'{a)p})) 



< 

Pe.9> 



1/2 



recalling the freezing of the A2 characteristic between 2"~^ and 2" for cubes in 
For the summation over the principal cubes, we observe that 

Pe^o Pe^o JQp^^a 

At any given x, if Pq C C . . . C Q are the principal cubes containing it, we have 

00 00 

E Mp1p(:^^) - E (^)^™ ^ E 2""(^)Po = 2(a)p„ < 2M(alQ)(a;), 

Pe,3^f m=0 m=0 

where M is the dyadic maximal operator. Hence 

E '^{P) < 2 / M{alQ)dx < 2[aU^aiQ) 

by the definition of the A^o characteristic 

Maoo := sup — ^ [ M{alQ) dx. 
Substituting back, we have 
II E ^KialQ) 



KCQ 



^EEE2 'E(l + ")|| E (^)pl{|S^a(p)(alo)|>n2-''(a>p} 

fe=0 a 6=0 n=0 PG^'" 

^EEE2-''E(i+")-^2-(2'' E -(P)) 

k=0 a 6=0 n=0 Pe^» 

^EEE2-''E(l + ")-^2— (2"[aU„)^/^ 

k=0 a 6=0 n=0 

K OC 00 

= c • mY: E ( E 2^/^) ( E 2-) ( Ed + • 2- 

fe=0 a<[log2[u),<T]A2l *'=0 "=0 

<^?-My'-(i+«)-[«'.^]y'. 
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and thus the testing constant (3 is estimated by 

By symmetry, exchanging the roles of w and cr, we also have the analogous result 
for 6*, and so we have completed the proof of the following: 

5.6. Theorem. Let 0,1x1 ^ Aoo be functions which satisfy the joint A2 condition 

F, O-Ua — sup r-T^ < 00. 

Q \Q\ 

Then the testing constants (5 and 6* associated with a dyadic shift S of type {i,j) 
satisfy the following bounds, where k :— max{i, j}.- 

&<C-{l + K)-[w,a]K' 



6* < C • (1 + k) • [w,a] 



1/2 r 1I/2 



6. Conclusions 

In this section we simply collect the fruits of the hard work done above. A 
combination of Theorem l4.2l and l5.6l gives the following two- weight inequality, whose 
qualitative version was pointed out by Lacey, Petermichl and Reguera 

6.1. Theorem. Let a,w E A^o be functions which satisfy the joint A2 condition 

[w, a\A2 ■■= sup --j^ < 00. 

Then a dyadic shift S of type {i,j) satisfies S{a-) : L'^{a) L'^{w), and more 
precisely 

\\S{(T-)\\L^(^a)~^L^w) < + K)^[w,cr]](^ {[w]](l + [a]](l), 
where k ~ max{i, j}. 

The quantitative bound as stated, including the polynomial dependence on k, 
allows to sum up these estimates in the Dyadic Representation Theorem to deduce: 

6.2. Theorem. Let a,w d Aoo be functions which satisfy the joint A2 condition. 
Then any L^ bounded Calderon-Zygmund operator T whose kernel K has Holder 
type modulus of continuity ■(/'(t) = i", a G (0, 1), satisfies 

||r(a.)|U.(.)^L.M < mW^L^ + \\K\\cz^)[w,aff^{[w]'il + [afH). 

Recalling the dual weight trick and specializing to the one- weight situation with 
a = , this in turn gives: 

6.3. Theorem. Let w (Iz A2. Then any L^ bounded Calderon-Zygmund operator 
T whose kernel K has Holder type modulus of continuity ip{t) = t" , a G (0, 1), 
satisfies 

riu^M^L^M < (riu.^L. + \\K\\cz jh'/^mK^ + k^iY^) 

<{\\nL-^L- + \\K\\czJHA,- 
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The second displayed line is the original A2 theorem [6J, and it follows from the 
first line by [w]a^ < [wj^a and [w^^]a^ < [w^^^l^a = [w]a2- Its strengthening 
on the first line was first observed in my joint work with C. Perez [5]. Note that, 
compared to the introductory statement in Theorem II. 1[ the dependence on the 
operator T has been made more explicit. (The implied constants in the notation 
"<" only depend on the dimension and the Holder exponent a.) This dependence 
on ||T||i2_j.^2 and ||-ftr||cZa is implicit in the original proof, but has not been spelled 
out before. 

7. Further results and remarks 

This final section briefiy collects some related developments, which were not 
covered in the actual lectures. 

The A2 theorem implies a corresponding Ap theorem for all p G (l,oo). This 
follows from the sharp weighted extrapolation theorem of Dragicevic, Grafakos, 
Pereyra, and Petermichl [3j , which was known well before the proof of the full A2 
theorem: 

7.1. Theorem. If an operator T satisfies 

\\T\\l^(w)^l-^{w) < Ct[w]a2 
for all w A2, then it satisfies 

\\rrl\ ^ ri r 1''' niax{l,l/(p-l)} 

V-\\hf[w)~Yhv{w)<CpCT\w\^^ 

for all p e (1, 00) and w G A^. 

1.2. Corollary. Let p G (l,oo) and w G Ap. Then any bounded Calderon- 
Zygmund operator T whose kernel K has Holder type modulus of continuity ip{t) — 
t" , a G (0,1), satisfies 

It is also possible to apply a version of the extrapolation argument to the mixed 
^2/^00 bounds [9], but this did not give the optimal results for p ^ 2. However, 
by setting up a different argument directly in U'{w), the following bounds were 
obtained in my collaboration with M. Lacey [7]: 

7.3. Theorem. Let p G (l,oo) and w G Ap. Then any L^ bounded Calderon- 
Zygmund operator T whose kernel K has Holder type modulus of continuity ^p{t) = 
t"' , a G (0,1), satisfies 

I|T|UpM^l.w < m\L^^L^ + \\K\\cz^)[w]T^{[wfil + [w'-'']Tj- 

For weak-type bounds, which were investigated by Lacey, Martikainen. Orponen, 
Reguera, Sawyer, Uriarte-Tuero, and myself [8], we need only 'half of the strong- 
type upper bound: 

7.4. Theorem. Let p G (l,oo) and w G Ap. Then any L^ bounded Calderon- 
Zygmund operator T whose kernel K has Holder type modulus of continuity ^p{t) = 
t"' , a G (0,1), satisfies 

l|T|U.(.)^i.,=.(,„) < (IITIU.^^. + Ili^llczJMX'Myf 
<{\\T\\l2^l-^- + \\K\\cz^)[w]a,. 
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All these results remain valid for the maximal truncations 

T^f{x):= sup \T J ix)\, TJix):= [ K{x,y)f{v) Ay, 

e>0 J\x — y\>e 

which have been addressed in [71 [H] . In [S] it was also shown that the sharp weighted 
bounds for dyadic shifts can be made linear (instead of quadratic) in k, a result re- 
covered by a different (Bellman function) method by Treil [20] . Earlier polynomial- 
in-K Bellman function estimates for the shifts were due to Nazarov and Volberg 
[17| . An extension of the A2 theorem to metric spaces with a doubling measure 
(spaces of homogeneous type) is due to Nazarov, Reznikov, and Volberg [14J. 

It seems that the strategy developed in [7j is the most efficient one for dealing 
with both the maximal truncations and the mixed Ap/ Arx, bounds. It consists of 
the following steps: 

• Reduction of Calderon-Zygmund operators to dyadic shift operators by the 
Representation Theorem, just like here. 

• Reduction of the dyadic shifts to positive dyadic shifts by an ingenious for- 
mula of A. Lerner |13j , which provides were precise and useful information 
of a function in terms of its 'local oscillations': the local oscillations of 
Sf (and even S\^f) can be estimated in terms of / with the help of the 
weak-type (1, 1) inequality for S (and S^). 

• Reduction of the estimates for positive shifts to testing conditions. 

• Verification of the testing conditions for the positive shifts. 

Notice that these steps are essentially the same as the ones followed in these lectures, 
except for the additional second step. After this further reduction, the last two 
steps are even slightly easier, since they are only needed for positive operators. 
In this setting, the orthogonality arguments, which were decisive for the present 
reduction to testing conditions, are replaced by positive-kernel estimates, for which 
appropriate theory valid in all L^, not just p — 2, has been developed by Lacey, 
Sawyer, and Uriarte-Tuero (12| . 

7. A. The Ahlfors— Beurling operator and its powers. One of the key original 
motivations to study the A2 theorem was a conjecture of Astala-Iwaniec-Saksman 
[Tj concerning the special case where T is the Ahlfors-Beurling operator 

Bfiz) :=--p.v. [ l/(z-C)dA(C), 

and A is the area measure (two-dimensional Lebesgue measure) on C ~ R^. This 
was the first Calderon-Zygmund operator for which the A2 theorem was proven; it 
was achieved by Petermichl and Volberg |19j . confirming the mentioned conjecture 
of Astala, Iwaniec, and Saksman Another proof of the A2 theorem for this 
specific operator is due to Dragicevic and Volberg [3]. 

The powers i?" of B have also been studied, and then it is of interest to under- 
stand the growth of the norms as a function of n. Shortly before the proof of the full 
A2 theorem, by methods specific to the Ahlfors-Beurling operator, O. Dragicevic 
[^ was able to prove the cubic growth 
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Now, let us see what the general A2 theorem gives for these specific powers. It 
is known that _B" is the convolution operator with the kernel 

and it is elementary to check that this satisfies ||-f^n||cZa ^ for any a G 

(0, 1). Moreover, since B is an isometry on L^(C), we have ||i3"||L2_^^2 — 1. From 
Corollarv l7.2l we deduce: 

7.5. Corollary. The powers _B" of the Ahlfors-Beurling operator satisfy 

where the implied constant depends on p and a. 

Thus the cubic bound improves arbitrarily close to a linear one. It seems plau- 
sible that even the linear growth (a = 0) should be true, but this would require 
additional insight, probably specific to the operator B. This final corollary is new; 
it arose from my discussions with O. Dragicevic in July 2011. In fact, this applica- 
tion motivated the formulation of the Dyadic Representation Theorem and the A2 
theorem with explicit dependence on ||T||^2_j,^2 -I- ||cZq, which might also turn 
out useful in other applications to families of Calderon-Zygmund operators. 
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